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ABSTRACT 


Let  X^, Xj, . . .  ,X^  be  a  random  sample  of ^ize  r 
from  a  binomial  distribution  b(n,  p).  Let  Xi^,  x^, . . . , 
x^  be  r-observed  success  counts.  A  method  has  been 
developed  to  estimate  the  total  number  of  trials  n 
ftom  a  Bayesian  perspective  when  the  probability  of 
success  p  is  either  known  or  unknown.  The  prior 
distribution  for  n  is  assumed  to  be  the  discrete  uni¬ 
form  distribution.  In  the  case  when  p  is  un¬ 
known,  p  is  assumed  to  have  a  beta  prior  distribution. 
The  estimate  for  n  is  then  the  mode  of  the  posterior 
distribution.  Additionally,  guidelines  for  selecting 
shape  parameters  for  the  beta  distributions  are 
discussed.  /  ^ ^ ,  i  'f‘3  r  f  f-  >  r  ^ 
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INTRODUCTION 


Consider  that  X^,  A’j, . . . ,  is  a  random  sample  of  size  r  from  a  binomial 
distribution  b(n,  p),  where  n  is  the  number  of  trials  and  p  is  the  probability  of  success. 
Given  r  observations,  x^,  X2, ,  x^,  the  usual  problem  in  the  binomial  situation  is  to 
estimate  p.  However,  in  other  instances,  n,  the  number  of  trials,  may  be  the  unknown 
parameter  of  interest.  For  example,  an  appliance  company  may  be  interested  in  estimat¬ 
ing  the  number  of  appliances  of  a  particular  type  in  a  given  service  area.  In  the  naval 
environment,  when  a  number  of  ships  have  been  located  or  detected  in  a  certain  region, 
sometimes  it  is  important  to  estimate  the  overall  number  of  ships  in  that  region  in  order  to 
decide  appropriate  actions.  These  are  some  of  the  practical  problems  in  which  «  in  a 
binomial  distribution  becomes  the  parameter  to  be  estimated. 

In  the  simplest  case,  when  r  =  1  and  p  is  known,  the  procedure  for  estimating  n 
by  constructing  confidence  intervals  has  been  worked  out  by  Tang  and  Sindler  in  [1]. 
This  research  is  a  follow-up  work  in  which  the  general  cases  are  considered  but  some  of 
the  restrictions  are  removed,  r  is  assumed  to  be  a  positive  integer  greater  than  one  and  p 
may  or  may  not  be  known.  Draper  and  Guttman  [2]  proposed  to  estimate  n  in  a 
Bayesian  framework  by  introducing  prior  probability  distributions  to  unknown  param¬ 
eters.  Then  the  estimation  of  n  is  based  on  the  posterior  distribution.  Since  prior 
probability  distributions  are  assumed,  more  unknown  parameters  become  involved.  But 
in  [2],  the  issue  of  how  to  select  the  values  of  the  parameters  has  not  been  addressed.  In 
this  research  memorandum,  some  guidelines  for  selecting  the  values  of  the  parameters  in 
the  prior  probability  distributions  are  recommended.  Recurrence  formulas  for  calculating 
the  posterior  probabilities  are  also  derived. 

The  method  of  estimating  n  from  a  Bayesian  viewpoint  is  outlined  in  the  next 
section.  Suggestions  for  choosing  values  of  unknown  parameters  in  the  prior  probability 
distributions  are  discussed.  Several  examples  are  also  presented  in  this  section.  Remarks 
and  directions  of  future  research  are  presented  in  the  last  section.  Characteristics  of  beta 
distributions  and  an  interactive  computer  program  are  included  in  appendixes. 


THE  METHOD 

Let  Xj,  X2, ....  X^  be  a  random  sample  of  size  r  from  a  binomial  distribution 
b{n,  p).  Given  the  observations  x^,  X2, ... ,  x^,  the  objective  is  to  estimate  n.  When 
r  =  1  and  p  is  known,  the  method  of  constructing  confidence  intervals  for  n  has  been 
presented  in  [1].  For  r  being  a  positive  integer  larger  than  one.  Draper  and  Guttman  [2] 


proposed  a  Bayesian  approach  for  estimating  n.  Adopting  their  notations,  the  likelihood 
can  be  written  as: 


L{n,p  x)  ap‘  (1  -p)^  ‘  In^^)  »  ’  (1) 

r 

where  x  =  (x.^,  X2, . . . ,  x^Y  is  a  column  vector  of  positive  integers  and  r  =  S  Xi  is  the 
total  number  of  successes  in  the  r  observations.  The  case  of  p  being  known  will  be 
discussed  first;  then,  the  case  of  p  being  unknown  will  be  discussed. 


p  is  Known 


When  p  is  known,  let  h(n)  denote  the  prior  distribution  of  n.  Without  further 
knowledge  of  n,  the  discrete  uniform  distribution  provides  a  reasonable  form  for  h(n). 

f  i  for  1  <  rt  <  iV 

h(n)  =  ]  ^  ,  (2) 

1 0  elsewhere 

where  N  is  a  predetermined  large  integer.  The  posterior  distribution  for  n  is  given  by 


p  (n\x,p)(x(l-pY'‘ h(n)  .  (3) 

The  domain  of  p(n  I  x,  p)  is  the  set  of  n  such  that  n  =  x^,  x^  +  1,  x^  +  2, . . . ,  N, 
where 


The  mode  of  the  posterior  distribution  p(n  lx,p)  given  in  expression  3,  denoted  by  n , 
provides  an  estimate  of  n.  n  ,  therefore,  is  the  integer  satisfying  the  following 
inequalities: 


p  {7i-l\x.p)<p  {fi\x,p)  , 


and 


pCn+l  ]x,p)<p  (fl\x,p)  . 


Or  alternatively,  n  is  the  solution  of  the  simultaneous  inequalities: 


[(«+l)(l-p)r<.n^(n  +  l-A:,)  . 


It  has  been  pointed  out  by  Feldman  and  Fox  [3]  that  n  is  also  the  maximum  likelihood 
estimator  for  n. 

In  addition  to  pro\tiding  an  estimate  for  n,  the  posterior  distribution  could  also 
cast  some  light  on  the  precision  of  the  estimate.  A  closed  form  of  the  estimator  for  n 
may  not  seem  feasible.  But  a  numerical  solution  can  be  obtained  by  using  the  following 
recurrence  formula.  For  n  =  +y,y  =  0, 1,  2, . . . ,  (N-  x^). 


p  (n  I  x,p)  =p  (x^  +j  \x,p)  =  C  ‘Qj  , 


where 


'I  if  ;■  =  0 

Qj=\  ^  (5) 

«  ..  ,r  .Otherwise  . 

^Qj-\K^-p)  — - 

■fj  (W  ~Xi) 

Thus,  the  normalizing  constant  C  in  expression  4  is  the  reciprocal  of  the  sum  of  the 
e/s,  i.e., 

c  =  \l  Qj  .  (6) 

;=0 

As  far  as  the  point  estimation  is  concerned,  an  estimate  for  n  can  be  obtained 
irrespective  of  the  predetermined  integer  A.  If  a  confidence  interval  with  a  specified 
confidence  coefficient  y  is  desired,  then  the  value  of  N  is  needed.  A  100  y-percent 
confidence  interval  for  n  is  given  by 

+  u]  ,  (7) 

where  £  and  u  are  integers  such  that 

« 

+7  U,p)  =  y  .  (8) 

/=l 

Since  £  and  u  in  expression  7  are  the  integers  satisfying  the  condition  8,  they  are  chosen 
such  that  the  summation  on  the  left-hand  side  of  8  is  approximately  equal  to  y 
and  1  -  y  is  roughly  equally  divided  to  the  two  tails.  Therefore,  a  100  y-percent  confi¬ 
dence  interval  for  n  may  not  be  unique. 


WW 


To  determine  a  suitable  value  of  N  for  computing  a  confidence  interval  for  n, 
one  may  adopt  the  scheme  given  below.  Let 

be  the  yth  partial  sum  ol  the  sequence  Qq,  Qj,  Qj* _  Po*"  ^  given  5  >  0,  j  is  defined 

to  be  the  smallest  integer  such  that 

Qj/Sj<5.  (10) 

Therefore,  the  required  integer  N  is  equal  to  +  j  -  L  The  criterion  stated  in  the 
inequality  10  suggests  that  the  posterior  probabilities,  beyond  the  value  of  N,  will  not 
contribute  significantly. 

Example  1:  r  =  l 

Suppose  p  is  known  to  be  0.2.  The  only  success  count  shows  that  ten  successes 
have  been  detected,  i.e.,  x  =  10.  Hence,  x^  is  also  equal  to  10.  An  estimate  for  n 
could  be  either  49  or  50.  Similarly,  48  and  51  are  also  likely  values  for  the  estimate. 
Since  j:  =  10  and  p  =  0.2,  xip  =  10/0.2  =  50.  The  possible  estimates  are  all  close  to  50. 

Using  the  criterion  10  for  5  =  0.005,  N  is  found  to  be  81.  A  95-percent  confi¬ 
dence  interval  for  n  is  [30,  77].  As  mentioned  earlier,  the  confidence  coefficient, 
95  percent,  is  only  an  approximation. 


Example  2:  r  =  4 

Assume  again  p  =  0.2.  Four  success  counts  are  available:  4,  8,  12,  and  8. 
Hence,  x^^  =12.  An  estimate  for  n  is  equal  to  40.  Other  possible  values  are  39 
and  41. 

For  5  =  0.005,  a  95-percent  confidence  interval  can  be  chosen  to  be  [31,  54]  or 
[26,  51].  Both  intervals  give  a  confidence  coefficient  only  to  approximately  95  percent. 
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p  is  Unknown 


When  p  is  unknown,  assume  that  n  and  p  are  independent.  Let  n  have  the 
same  prior  probability  distribution  as  stated  in  expression  2.  Suppose  that  the  prior 
probability  distribution  for  p  is  in  the  form  of  a  beta  distribution  with 
parameters  and  v^.  Let  k  (p)  denote  the  prior  probability  distribution  of  p.  Thus, 

^(p)ap (l-p)'^"^  0Sp<l  .  (11) 

Differentiate  k  (p)  in  expression  1 1  with  respect  to  p  and  let  the  first  derivative  of  k  ip) 
equal  zero.  Then  equation  12  represents  the  relationship  among  the  maximum  value  and 
the  two  unknown  parameters,  Vj  and  V2,  in  the  beta  prior  distribution; 

(I -p)  V, -pv2  -  I +2p  =0  ,  (12) 

with  an  initial  estimate  of  p;  Vj  and  V2  can  be  solved  through  equation  12.  However, 
the  solution  for  Vj  and  V2  is  not  unique.  If  the  initial  estimate  of  p  was  obtained  with 
high  certainty,  then  the  parameters  Vj  and  V2  should  be  chosen  with  larger  values,  e.g., 
Vj  =  10  or  20.  Otherwise,  use  smaller  values;  for  example,  Vj  may  be  2  or  3.  This 
recommendation  is  based  on  the  characteristics  of  beta  distributions.  Beta  distributions 
are  further  discussed  in  appendix  A. 

The  joint  posterior  distribution  is  given  by 

/  I  /+vi  -  1  ,,  .rn-t+v>  -\  ,  ,  .  r  n\ 

p{n,p\x)ap  ‘  (1-p)  “  h  (n)  ^  ,  .  (13) 

The  marginal  distribution  of  n  can  be  obtained  by  integrating  expression  13  with  respect 
to  p  fromO  to  1.  Therefore, 


p  (n\x)  a 


(rn-t  +  V2-l)!  r  nl 
{rn  +  Vj  +  V2  -  1)  !  i=i  {n  -x,  )  ! 


,  forx 


max 


<n<N 


(14) 


Again,  the  mode  n  of  expression  14  would  provide  an  estimate  of  n.  Similarly,  if 

'^=^max  =  0,  1,  2,  ...  ,  N-X^, 


p{n\x)  =  p  {Xfnax  +j\x)  =  C  •  Qj  , 


(15) 


t  +  vj  +  (J  -  r  +  i] 


1=0  ['•  W  +  vi  +  V2  +  O'  -  1) 

^tnax  . 

-  ,  otherwise 

^^max  ■*■•/  ■  ^0 

is  a  recurrence  formula  for  calculating  Qj.  The  normalizing  constant  C  in  expression  15 
can  be  computed  in  exactly  the  same  manner  as  before. 


Example  3: 

Suppose  p  is  unknown.  An  initial  p  is  found  to  be  0.2.  Assume  that  the  only 
success  count  gives  ten  successes.  If  5  in  criterion  10  is  chosen  to  be  0.005,  then  the 
estimates  at  various  levels  of  certainty  are  presented  in  table  1. 


TABLE  1 

ESTIMATES  OF  n  WHEN  Ts  1  AND  p  IS  UNKNOWN 

n  N  95%  confidence  interval 


2.  =  5 

29  or  30 

5.  ^2  =  17 

41  or  42 

10.  1^2  =  37 

45  or  46 

20,  V,  =  77 

47  or  48 

17,  102 
23.  96] 

26,  89] 

27.  83] 


When  the  initial  estimate  of  p  is  made  with  high  certainty,  such  as  Vj  =  20  and 
=  77,  the  point  estimate  for  n  is  almost  identical  to  the  result  given  in  example  1,  in 
which  p  is  known.  However,  with  p  unknown,  confidence  intervals  are  not  as  tighc. 


Example  4:  r-4 


Consider  the  set  of  data  given  in  example  2.  Assume  that  an  initial  estimate  for  p 
is  0.2  and  6  =  0.005.  The  results  are  summarized  in  table  2. 


TABLE  2 

ESTIMATES  OF  n  WHEN  r=4  AND  p  IS  UNKNOWN 

n  N  95%  confidence  Interval 


V,  =  2.  =  5 

29 

103 

[17,  97] 

V,  =  5,  V2  =  17 

35 

85 

[22,  81] 

V  =  10,  V  =37 

38 

74 

[24,  69] 

II 

ro 

p 

II 

39 

67 

[26,  62] 

CONCLUSIONS  AND  REMARKS 

In  this  research,  the  procedure  for  estimating  n  in  a  binomial  distribution  has 
been  developed  if  there  are  r  success  counts  available.  Both  cases  of  p  being  known 
and  cases  of  p  being  unknown  are  discussed.  The  approach  is  adopted  from  a  Bayesian 
point  of  view,  which  was  first  proposed  by  Draper  and  Gunman  in  [2].  Prior  probability 
distribution  for  n  is  assumed  to  be  the  discrete  uniform  distribution.  When  p  is 
unknown,  p  is  assumed  to  have  a  beta  prior  probability  distribution.  The  estimator  for  n 
is  then  the  mode  of  the  posterior  distribution. 

To  put  the  estimation  procedure  into  practice,  suggestions  of  how  to  select  values 
of  unknown  quantities  in  the  prior  probability  distribution  are  provided.  An  interactive 
computer  program,  written  in  FORTRAN  language,  is  included  in  appendix  B. 

The  interval  estimation  for  n  is  also  discussed  in  this  paper.  As  pointed  out  in 
the  paper,  confidence  coefficients  are  not  exact  because  n  under  consideration  is  dis¬ 
crete.  Furthermore,  confidence  intervals  are  not  unique. 

Future  study  may  include  a  simulation  to  verify  whether  the  proposed  procedure 
for  constructing  confidence  intervals  for  the  specified  confidence  coefficients  has  actually 
been  achieved.  Another  topic  worth  further  investigation  is  the  assessment  of  the  error  of 
the  estimators.  The  bootstrap  method  may  be  a  useful  tool  for  the  investigation. 
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CHARACTERISTICS  OF  BETA  DISTRIBUTIONS 


Let  y  be  a  random  variable  having  a  beta  distribution  with  parameters  Vj  and 
V2.  The  probability  density  function  of  Y  is  given  by 


P(vi  +V2)  v,-l 


=  J  r(vi)*r(v2) 


/!-■  0<y<l  , 


,  elsewhere 


(A-1) 


where  Vj  and  V2  are  positive  and  r(.)  denotes  the  gamma  function.  The  shape  of  the 
density  of  f  (y)  given  in  equation  A-1  depends  on  the  values  of  Vj  and  V2. 


Case  1:  Vj  =  Vj  =  1 


f(y)  in  (A-1)  becomes 


/(y)  = 


1  0<y<l 

0  elsewhere 


(A-2) 


which  is  the  density  of  the  unifom  distribution  in  the  unit  interval.  The  graph  of  (A-2)  is 
shown  in  figure  A-1. 


Case  2:  Vj  =  Vj  >  I 


Equation  A-1  becomes 


(A-3) 


elsewhere 


The  function  /  (y)  in  expression  A-3  is  symmetric  with  respect  to  the  line  y  =  1/2.  The 
mode  occurs  at  y  =  1/2,  The  mode  is 


/(l/2)  = 


r(2vi) 

[r(v,)]2  \4) 


Graphs  of  (A-3),  with  various  values  of  Vj,  are  shown  in  figure  A-2. 


FIG.  A-1 :  Vj  =  Vj  =  1 


Case  3:  Vj  >  Vj  >  1 


Equation  A-1  is  not  symmetric.  The  mode  occurs  at 
y  =  (vi  -l)/[(v,  -1)  +  (V2-1)]  . 

In  this  case,  the  mode  y  is  less  than  one-half.  The  density  shows  a  positive  skewness;  in 
other  words,  the  graph  has  a  long  tail  to  the  right.  Graphs  are  presented  in  figure  A-3  for 
various  values  of  v  and  v  with  the  mode  equal  to  0.2. 

Case  4:  Vj  >  Vj  >  1 

Case  4  is  the  reverse  of  case  3.  The  mode  is  larger  than  one-half.  The  density 
shows  a  negative  skewness;  therefore,  it  has  a  long  tail  to  the  left.  Graphs  of  beta  dis¬ 
tributions  in  this  are  the  symmetric  images  with  respect  to  the  line  y  =  1/2  of  those  given 
in  case  3  and  shown  in  figure  A-3. 


BAYESIAN  ESTIMATION  OF  n  IN  A  BINOMIAL  DISTRIBUTION 


I 


PROGRAM  INPUT  1).  OBSERVATIONS  FOR  RANDOM  VARIABLE  X(J) 

2) .  PROBABILITY  OF  SUCCESS  (PRSUCC) 

3) .  SHAPE  PARAMETER  FOR  PRIOR  BETA  DISTRIBUTION  (VI) 
PROGRAM  OUTPUT  1).  POSTERIOR  PROBABILITIES 

2).  NUMBER  OF  TRIALS  (n) 

variable  DEFINITIONS 


GAMMA 

QSUM 

POSTPROB 

PRSUCC 

TSUM 

VI 

V2 

XJ 

XJHAX 

MODE 


LBOUND 

UBOUND 


THE  NUMBER  OF  SUCCESS  COUNTS  (NUMBER  OF  Z(J)'s) 

SUM  OF  Q 

COMPUTED  POSTERIOR  PROBABILITY 
PROBABILITY  OF  SUCCESS  (BINOMIAL) 

TOTAL  NUMBER  OF  SUCCESSES  (SUM  OF  X(J)'S) 

SHAPE  PARAMETER  OP  BETA  PRIOR  DISTRIBUTION 
SHAPE  PARAMETER  OP  BETA  PRIOR  DISTRIBUTION 
OBSERVATION  OF  RANDOM  VARIABLE  X(J) 

(NUMBER  OF  OBJECTS  DETECTED) 

MAXIMUM  VALUE  OF  RANDOM  VARIABLE  X(J) 

MODE  OF  PROBABILITY  DISTRIBUTUIN .  THIS  IS 

THE  n  VALUE  THAT  HAS  THE  MAXIMUM  PROBABILITY 
SUM  OF  PROBABILITIES  IN  LOWER  TAIL  OF  DISTRIBUTION 
FOR  A  95%  CONFIDENCE  INTERVAL 
SUM  OF  PROBABILITIES  IN  UPPER  TAIL  OF  DISTRIBUTION 
FOR  A  95%  CONFIDENCE  INTERVAL 
LOWER  BOUND  OF  CONFIDENCE  INTERVAL  FOR  n 
UPPER  BOUND  OF  CONFIDENCE  INTERVAL  FOR  n 


DATA  DECLARATION / INITIALIZATION 

INTEGER  XJ(IO),  XJMAX.  XVALUE,  GAMMA,  TSUM.  N(IOOO).  UBOUND 
DOUBLE  PRECISION  (?(1000).  <?SUM,  POSTPROB ( 1000) .  MODE,  LSUM,  USUM 
CHARACTER  •  1  HE 


DELTA 
XJMAX 
TSUM 
9SUM 
DATA  N 


■  .005 
.  0 

■  0 

■  0.0 

aooo‘0/ 


DATA  XJ  -'10*0/ 

ACCEPT  XJ'S  FROM  INPUT  DEVICE  AND  COMPUTE  GAMMA.  TSUM.  XJMAX 


DO  20  I-l.  10 
10  PRINT  •  . 
PRINT  • . 
a  X(J)  (UP  TO 
PRINT  * . 
PRINT  *, 
ACCEPT  15 


ENTER  VALUE  FOR  THE  NUMBER  OF  OBSERVATIONS  IN  SAMPLE 
10  SAMPLES)  ' 

PRESS  RETURN  WHEN  ALL  SAMPLES  HAVE  BEEN  ENTERED  ' 


ACCEPT  15.  XVALUE 
FORMAT  (180) 

IF  (XVALUE  .EQ.  0)  THEN 
IF  (I  .GT.  1)  THEN 
GOTO  35 
ELSE 

PRINT  • ,  '  ' 

PRINT  ♦.  'MUST  EN 
GOTO  10 
ENDIF 
ENDIF 

GAMMA  -  I 


'MUST  ENTER  AT  LEAST  ONE  VALUE  ' 


XJ(I) 

TSCM 


XVALUE 

TSUM  -  XVALUE 


w  VI  w'l  w  • 


wy^ 


c? 


> 

V 


V/ 

i:' 


K 


1WW  Hr  IV  im PV1  vv  w'  w  wipw  i> 


C 

C 

c 


c 

c 

c 


DETERMINE  MAXIMUM  SAMPLE  VALUE 
IF  (XVALUE  .GE.  XJMAX)  XJMAX  -  X VALUE 
20  CONTINUE 

ACCEPT  PROBABILITY  OF  DETECTION  FROM  INPUT  DEVICE  AND  EDIT  VALUE 


35 


40 


PRINT  ' 
PRINT  ' 
PRINT  ’ 
PRINT  < 
ACCEPT 
FORMAT 


40 


ENTER  PROBABILITY  OP  SUCCESS  ' 
(MUST  BE  .1.  .2  .3.  .4.  OR  .5) 


PRSUCC 


(F80.2) 


IF 

((PRSUCC 

■  EQ. 

.1)  .OR. 

a 

( PRSUCC 

.EQ. 

.2)  .OR. 

a 

( PRSUCC 

•EQ. 

.3)  .OR. 

a 

( PRSUCC 

.EQ. 

.4)  -OR. 

a 

(PRSUCC 

.EQ. 

.5))  THEN 

41 


42 


GOTO  41 
ELSE 

PRINT  • 
PRINT  • 
GOTO  35 
ENDIF 
PRINT  ' ,  ' 
PRINT  • 

PRINT  ■ 

PRINT  * .  '  ' 
ACCEPT  42.  KE 
FORMAT  (Al) 

IF  ((XE  -EQ. 

GOTO  45 
ELSEIF  ((KE 
GOTO  500 
ELSE 

PRINT  • , 
PRINT  * . 
GOTO  41 
ENDIF 


PROBABILITY  OF  SUCCESS  MUST  BE  .1,  .2,  .3,  OR  .4' 


IS  PROBABILITY  OF 
ENTER  K  OR  E' 


SUCCESS  KNOWN  (K)  OR  ESTIMATED  (E) 


•EQ 


)  .OR. 
'K'  ) 


(KE  .EQ.  'e' 
(KE  .EQ. 


OR 


))  THEN 
'  It '  )  )  THEN 


'MUST  ENTER  K  OR  E 


C 

C 

C 

c 

c 

c 

c 

c 

c 


COMPUTE  PARAMETER  V2  FOR  BETA  PRIOR  DISTRIBUTION.  A  VALUE  FOR 
SHAPE  PARAMETER  VI  (FLOATING  POINT  VALUES  2  THROUGH  10)  IS  ACCEPTED 
FROM  THE  INPUT  DEVICE.  A  LARGER  VALUE  OF  VI  INDICATES  GREATER 
CERTAINTY  FOR  THE  SELECTED  PROBABILITY  OF  SUCCESS.  V2  IS 
THEN  COMPUTED  AS  A  FUNCTION  OF  VI  BASED  ON  THE  FOLLOWING 
EQUATION.  (VI  -l)(l-p)  -  (V2-I)(p)  -  0.  WHERE  p  IS  THE 
PROBABILITY  OF  SUCCESS. 


45 


PRINT 
PRINT 
PRINT 
^^0  AND 
PRINT 
PRINT 
SED  ' 
PRINT 
PRINT 


*.  'ENTER  SHAPE  PARAMETER  FOR  PRIOR  BETA  DISTRIBUTION' 

*.  '(PARAMETER  MUST  TAKE  ON  FLOATING  POINT  VALUES  BETWEEN  2. 

10.0)  ' 


'LARGER  VALUES  INDICATE  GREATER  CERTAINTY  FOR  THE  SELECT 
'PROBABILITY  OF  SUCCESS.  ' 


B-2 


ooo  ooo  o o o o o a o o n o o o o o o o o n o 


ACCEPT  50.  VI 
50  FORMAT  (FaO.2) 

IF  (VI  .GE.  2.0  .AND.  VI  .LE.  10)  THEN 
CONTINUE 
ELSE 

PRINT  • ,  '  ' 

PRINT  *.  'SHAPE  PARAMETER  MOST  TAKE  ON  VALUES  BETWEEN  2.0  AND  1 
SO.  O' 

GOTO  45 
ENDIF 

IF  (PRSUCC  .EQ.  0.1)  THEN 
V2  -  (9.0  •  VI)  -  8.0 

ELSEIF  (PRSUCC  .EQ.  0.2)  THEN 
V2  -  (4.0  ‘  VI)  -  3.0 

ELSEIF  (PRSUCC  .EQ.  0.3)  THEN 
V2  -  ((7.0  •  VI)  -  4.0)  .  3.0 
ELSE 

C  •••  PRSUCC  -  0.4 

V2  -  ((3.0  •  VI)  -  1.0)  /  2.0 
ENDIF 

COMPUTE  ESTIMATED  PROBABILITY  CASE 

COMPUTE  Q(I)  AND  QSUM.  COMPUTATIONS  ARE  TERMINATED 
WHEN  Q(I)  QSUM  IS  LESS  THAN  OR  EQUAL  TO  DELTA. 

Q(I)  IS  THE  PRODUCT  OF  THE  FOLLOWING  THREE  FACTORS 

1) .  Q(I-l) 

2) .  THE  PRODUCT  FROM  (GAMMA‘XJMAX-TSUM+V2+( ( 1-1 ) ‘GAMMA) *J) 

J-0  TO  GAMMA-1  (GAMMA‘XJMAZ-Vl-V2+( (I-l) ‘GAMMA) ♦J) 


3).  ((XJMAX*I)“GAMMA)  THE  PRODUCT  FROM  (XJMA3:-XJ( J)*I) 

J-1  TO  GAMMA 

LET  CONSTANT  A  -  GAMMA ‘XJMAI-TSUM+VE 
LET  CONSTANT  B  -  GAMMA •XJMA2+ VI -Va 

A  -  FL0AT(GAMMA)*FL0AT(XJMAX)-FL0AT(TSUM)*V2 
B  -  FL0AT(GAMMA)*FL0AT(XJMAX)*V1-V2 
Q(l)  -  1.0 
DO  200  1-1,9999 
ISTORE  -  I-^l 
N(I)  -  XJMAX-^I-1 
IF  (N(I)  .EQ.  129)  GOTO  205 

COMPUTE  FACTOR  1 

IF  (I  .GT.  1)  THEN 
FACTRl  -  Q(I) 

ELSE- 

FACTRl  -  1.0 
ENDIF 

COMPUTE  Factor  2 

FACTR2  -  1.0 
DO  105  J-0. GAMMA-1 

FACTR2  -  FACTR2  •  (  A+(  ( I-l )  •GAMMA)+J) -(  B+ (( I-l  ) ‘GAMMA) -^J) 
105  CONTINUE 


ooo  ooo  o n o o n o o n o o o o o o  noo  non  ooo  ooo 


COMPOTE  FACTOR  3 

PRODCT  -1,0 
DO  110  J-1 .GAMMA 

PRODCT  -  PRODCT  •  ( FL0AT( XJMA2) -FLOATC XJ( J) ) -FL0AT( I ) ) 
110  CONTINUE 

FACTR3  -  CFLOAT(XJMAX-I)* “GAMMA)  PRODCT 

COMPOTE  <5(1)  AND  iJSUM 

(?(I-1)  -  FACTR1*FACTR2*FACTR3 
QSOM  -  QSOM  -  Q(I) 

TEST  TERMINATION  CRITERIA 

IF  (Qd) '<5SOM  .GT.  DELTA)  THEN 
CONTINUE 
ELSE 

GOTO  205 
ENDIF 
200  CONTINUE 

COMPOTE  POSTERIOR  PROBABILITY  DISTRIBUTION. 

205  DO  300  I-l.ISTORE 

POSTPROB(I)  -  QCD/QSUM 
300  CONTINUE 
GOTO  702 

COMPUTE  ENOVN  PROBABILITY  CASE 

COMPOTE  9(1)  AND  9SUM.  COMPUTATIONS  ARE  TERMINATED 
WHEN  9(1)  9SUM  IS  LESS  THAN  OR  E9DAL  TO  DELTA. 

9(1)  IS  THE  PRODUCT  OF  THE  FOLLOWING  THREE  FACTORS 

1) .  9(1-1) 

2) .  (1-P)* “GAMMA 


3).  ((XJMA2-I)“ “GAMMA) 

500  9(1)  -  1-0 

DO  600  1-1.9999 
ISTORE  -  1*1 
N(I)  -  XJMAX-I-1 


THE  PRODUCT  FROM  (IJMAZ-ZJ( J)*I ) 
J-1  TO  GAMMA 


IF  (N(I)  .E9.  120)  GOTO  605 


COMPUTE  FACTOR  1 


IF  (I  .GT.  1)  THEN 
FACTRl  -  9(1) 

ELSE 

FACTRl  -  1.0 
ENDIF 

COMPUTE  FACTOR  2 

FACTR2  -  (1.0  -  PRSUCO*  “GAMMA 


B-4 


COMPUTE  FACTOR  3 


PRODCT  -  1.0 
DO  510  J-1, GAMMA 

PRODCT  =■  PRODCT  *  ( FLOAT( IJMAX)-FLOAT(XJ(  J)  )*FLOAT(  I )  ) 
510  CONTINUE 

FACTR3  >  (FLOAT(XJMAX-I)' ’GAMMA)  PRODCT 

COMPUTE  Q(I)  AND  QSUM 

0(1-1)  -  FACTR1’FACTR2’FACTR3 
QSUM  -  OSUM  -  0(1) 

TEST  TERMINATION  CRITERIA 

IF  (0(1)  OSUM  .GT.  DELTA)  THEN 
CONTINUE 
ELSE 

GOTO  605 
ENDIF 
600  CONTINUE 

COMPUTE  POSTERIOR  PROBABILITY  DISTRIBUTION. 

605  DO  610  I-l,ISTORE 

POSTPROBd)  -  0(I)'''9SUM 
610  CONTINUE 


GENERATE  AlfD  DISPLAY  REPORT  ON  CRT 


702  WRITE  (6.705) 

705  FORMAT  ( IHO , 'BAYESIAN  ESTIMATION  OF  n  IN  A  BINOMIAL  DISTRIBUTI 
q#0N'  ) 

WRITE  (6.710)  PRSUCC 

710  FORMAT  ( IHO PROBABILITY  OP  SUCCESS  (ESTIMATE)  -  '.F3,l) 

IF  (CKE  .EQ.  'E')  .OR.  (KE  .EQ.  'e'))  THEN 
WRITE  (6.715)  VI.  V2 

715  FORMAT  ( IH  .'SHAPE  PARAMETERS  FOR  PRIOR  BETA  VI  -  ',F5:2. 

N  '  V2  -  ' . F5 . 2 .  '  ) 


720 

<y 


ENDIF 

DO  730  I-l. 10 

IF  (XJ(1)  .EQ.  0)  THEN 

GOTO  730 

ELSEIF  (I  .LE.  9)  THEN 

WRITE  (6.720)  I.  XJ(I) 

FORMAT  (IH  .'NUMBER  OF  OBSERVATIONS  IN  SAMPLE  XC.Il. 
' )  -  ' ,12) 


'725 


ELSE 

WRITE  (6.725)  I.  XJ(I) 

FORMAT  (IH  .'NUMBER  OF  OBSERVATIONS  IN  SAMPLE  X('.I2. 
' )  -  ■ .12) 


ENDIF 
730  CONTINUE 

DO  733  I“1  9999 

IF  (POS'TPROBd)  .GT.  POSTPROBd- 1 ) )  THEN 
IMODE  -  I 

MODE  -  POSTPROBd) 

GOTO  734 
ENDIF 
733  CONTINUE 


non 


734  WRITS  (6.735) 

735  "ORMAT  (IHO,'  n  p(n:X,p)') 

WRITE  (6.740) 

740  FORMAT  (  IH  ,  '  - '  ) 

DO  750  I-IMODE-5  .  IMODE'^5 

WRITE  (6.745)  N(I).  POSTPROB( I ) 

745  FORMAT  ( IH  , 22 . 13 , 4X . F7 . 6) 

750  OONTINUE 

GENERATE  95%  CONFIDENCE  INTERVAL  FOR  n 

LSUM  -  0.0 
DO  770  1-1.9999 

LSUM  -  LSUM  ♦  POSTPROB(I) 

IF  (LSUM  .LE.  0.025)  THEN 
LBOUND  -  N(I*1) 

GOTO  770 
ELSE 

LSUM  -  LSUM  -  POSTPROB(I) 

GOTO  780 
ENDIF 

770  CONTINUE 

N  -  ISTORE'2 

780  USUM  -  LSUM 

DO  790  I-ISTORE-2. 1, -1 

USUM  -  USUM  -  POSTPROB(I) 

IF  (USUM  -LE.  0.05)  THEN 
UBOUND  -  N(I-l) 

GOTO  790 
ELSE 

GOTO  800 
ENDIF 

790  CONTINUE 

800  write  (6.810)  N(IMODE) 

810  FORMAT  ( IH  ./.'  MODE  OF  DISTRIBUTION  IS  n  -  '.13) 
WRITE  (6.820)  LBOUND 

820  FORMAT  (IHO. 'LOWER  BOUND  OF  95%  CONFIDENCE  INTERVAL 
WRITE  (6.830)  UBOUND 

830  FORMAT  ( IH  .  UPPER  BOUND  OF  95%  CONFIDENCE  INTERVAL 


PRINT  * .  '  ' 


[3 


